Finding Regressional Outliers by Dynamic Projections

نویسنده

  • Anna Bartkowiak
چکیده

Atypical observations hidden in the data may play quite an disastrous role in a tted regression, especially when commonly used outlier detection techniques like computing leverages, Mahalanobis distances, ordinary and studentized residuals, DFFits, cross-validations { do not detect them. However (multivariate) outliers can be detected quite easily by graphical techniques , e.g. scatterplot matrices, spin plots or dynamic projections using the grand tour method. We nd here specially useful the grand tour. When we know about the outliers and their location in the multivariate space, then we may account for them by special regression models. We illustrate our considerations using the Modiied Wood Gravity data from Rousseeuw and Leroy (1987).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of outliers types in multivariate time series using genetic algorithm

Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...

متن کامل

Robust Subspace Outlier Detection in High Dimensional Space

Rare data in a large-scale database are called outliers that reveal significant information in the real world. The subspace-based outlier detection is regarded as a feasible approach in very high dimensional space. However, the outliers found in subspaces are only part of the true outliers in high dimensional space, indeed. The outliers hidden in normalclustered points are sometimes neglected i...

متن کامل

Outliers in dynamic factor models

Dynamic factor models have a wide range of applications in econometrics and applied economics. The basic motivation resides in their capability of reducing a large set of time series to only few indicators (factors). If the number of time series is large compared to the available number of observations then most information may be conveyed to the factors. This way low dimension models may be es...

متن کامل

Incremental Local Evolutionary Outlier Detection for Dynamic Social Networks

Numerous applications in dynamic social networks, ranging from telecommunications to financial transactions, create evolving datasets. Detecting outliers in such dynamic networks is inherently challenging, because the arbitrary linkage structure with massive information is changing over time. Little research has been done on detecting outliers for dynamic social networks, even then, they repres...

متن کامل

SubRank: Ranking Local Outliers in Projections of High-Dimensional Spaces

Outlier mining has become an increasingly urgent issue in the KDD process, since it may be the case that finding exceptional events is more interesting than searching for common patterns. These outliers are most relevant to be found for instance in fraud detection processes. Unfortunately, existing approaches do not take into account that increasing dimensionality leads to a novel understanding...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998